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Advanced In Silico Tools for Designing 
of Antigenic Epitope as Potential Vaccine 
Candidates Against Coronav irus 



Mehak Dangi, Rinku Kumari, Bharat Singh, and Anil Kumar Chhillar 


15.1 Introduction 

Coronaviruses are remarkably large, positive-stranded RNA viruses that are 
enveloped with the nucleocapsid having helical symmetry. The corona in coronavi- 
rus is a Latin word that means a “crown”, and it indicates to the typical presentation 
of virions underneath electron microscopy with a periphery of hefty, globular surface 
projections similar to that of a crown. Coronavirus is a pathogen associated with 
severe respiratory symptoms and was first identified from the nasal cavities of 
sufferers with the common cold in the early 1960s (de Groot et al. 2013; Brown 
et al. 2012). These were named human coronavirus OC43 and human coronavirus 
229E. A total of 40 sequenced genomes of different strains of coronavirus are 
accessible from National Center for Biotechnology Information (NCBI), out of 
which 7 are pathogenic to humans. A coronavirus, i.e. SARS-CoV, was responsible 
for outbreak of severe acute respiratory syndrome (SARS) in the year 2003, whereas 
Middle East respiratory syndrome coronavirus (MERS-CoV) caused the most recent 
outbreak in 2012 causing acute respiratory disease in affected people with signs of 
fever, cough and difficulty in breathing. After first reported from Saudi Arabia in 
2012, this novel virus has also dispersed to other countries like the United States and 
was known to have high death rate. MERS-CoV infections are highly communica¬ 
ble, and no explicit antiviral cure has been designed for it till date (Azhar et al. 2017). 
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It compelled us to apply the well-known reverse vaccinology (RV) approach on 
available proteome of coronavirus. RV approach has been successfully applied on 
many prokaryotes, but there are very few known applications on eukaryotes and 
viruses. So, it is worthwhile to explore the potential of this approach to identify 
potential vaccine candidates for coronavirus. RV basically does the in silico exami¬ 
nation of the viral proteome to hunt antigenic and surface-exposed proteins. This 
approach was initially applied successfully to Neisseria meningitidis serogroup B 
(Kelly and Rappuoli 2005) against which none of the prevailing techniques could 
develop a vaccine. The present book chapter is intended to explore the potential of 
RV approach to select the probable vaccine candidates against coronavirus and 
validate the results using docking studies. 


15.2 The Elementary Concept of Reverse Vaccinology 

Undoubtedly, the traditional approaches for vaccine development are fortunate 
enough to efficiently resist the alarming pathogenic diseases of its time. However, 
the traditional approach suffers from certain limitations like it is very time- 
consuming, the pathogens which can’t be cultivated in the lab conditions are out 
of reach, and certain non-abundant proteins are not accessible using this approach 
(Rappuoli 2000). Consequently, a number of pathogenic diseases are left without 
any vaccine against them. All these limitations are conquered by reverse 
vaccinology approach utilizing genome sequence information which ultimately is 
translated into proteins. Hence all the proteins expressed by the genome are accessi¬ 
ble irrespective of their abundance, conditions in which they expressed. The credit of 
fame of reverse vaccinology should go to the advancements in the sequencing 
strategies worldwide. Accordingly, improvement in the sequencing technologies 
has flooded the genome databases with huge amount of data which can be computa¬ 
tionally undertaken to reveal the various crucial aspects of the virulence factors of 
the concerned pathogen. Reverse vaccinology is based on same approach of com¬ 
putationally analysing the genome of pathogen and proceeds step by step to ulti¬ 
mately identify the highly antigenic, secreted proteins with high epitope densities. 
The best epitopes are selected as potential vaccine candidates (Pizza et al. 2000). 
This approach has brought the unapproachable pathogens of interest in spotlight and 
is evolving as the most reassuring tool for precise selection of vaccine candidates and 
brought the use of peptide vaccines in trend (Sette and Rappuoli 2010; 
Kanampalliwar et al. 2013). 


15.3 Successful Applications of Reverse Vaccinology 

Bexsero is the first universal serogroup B meningococcal vaccine developed using 
RV, and it has currently earned positive judgement from the European Medicines 
Agency (Gabutti 2014). Whether it is discovery of pili in gram-positive pathogens 
which were thought to not have any pili or the sighting of factor G-binding protein in 
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meningococcus (Alessandro and Rino 2010), the reverse vaccinology steals all the 
credits from other conventional approaches. Most of the applications of RV are 
against prokaryotes and very few against eukaryotes and viruses because of com¬ 
plexity of their genome. Corynebacterium urealyticum (Guimaraes et al. 2015), 
Mycobacterium tuberculosis (Monterrubio-Lopez et al. 2015), H. pylori (Naz et al. 
2015), Acinetobacter baumannii (Chiang et al. 2015), Rickettsia prowazekii (Caro- 
Gomez et al. 2014), Neospora caninum (Goodswen et al. 2014) and Brucella 
melitensis (Vishnu et al. 2017) are the examples of some pathogens that are recently 
approached using this in silico technique in order to spot some epitopes having 
potential of being a vaccine candidate. Herpesviridae (Bruno et al. 2015) and 
hepatitis C virus (HCV) (Kolesanova et al. 2015) are the examples of the viruses 
that are addressed using this approach. 


15.4 Workflow of Reverse Vaccinology (with Example 
of Coronavirus) 

15.4.1 Retrieval of Proteome of Different Strains of Coronavirus 
from NCBI 

The proteome of different strains of the coronavirus of interest was downloaded from 
NCBI’s ftp site (ftp://ftp.ncbi.nlm.nih.gov/genomes/Viruses/; NCBI Resource 
Coordinators 2017). The proteome information is available for download in many 
formats including FASTA format for different sequenced viruses. Strains pathogenic 
to humans were selected for further analysis. Among them a single strain was 
selected as the seed genome on the basis of literature. Sequence similarity searches 
using Blastp (http://blast.ncbi.nlm.nih.gov/blast, http://ugene.unipro.ru/) were 
performed to reveal the orthologs in different strains (Altschul et al. 1990; 
Okonechnikov et al. 2012; Golosova et al. 2014). Multiple sequence alignment 
(MSA) was done via ClustalW, and the phylogenetic tree was constructed using 
NJ method from Unipro UGENE 1.16.1 bioinformatics toolkit (Okonechnikov et al. 
2012 ). 


15.4.2 Analysis of Secondary Structure of Proteins from Seed 
Genome 

Analysis of secondary structure of the proteins of seed genome was done by means 
of ExPASy portal. The aim is to forecast the solvent accessibility, instability index, 
theoretical pi, molecular weight, grand average of hydropathicity (GRAVY), ali¬ 
phatic index, number of charged residues, extinction coefficient etc. (http://web. 
expasy.org/protparam/; Gasteiger et al. 2005). 




332 


M. Dangi et al. 


15.4.3 Subcellular Localization Predictions and Count 
of Transmembrane Helices 

Virus-mPLoc was used to identify the localization of proteins of virus in the infected 
cells of host (http://www.csbio.sjtu.edu.cn/bioinf/virus-multi/; Hong-Bin Shen and 
Kuo-Chin Chou 2010). This information is important to understand the destructive 
role and mechanism of the viral proteins in causing the disease. In total six different 
subcellular locations, namely, host cytoplasm, viral capsid, host plasma membrane, 
host nucleus, host endoplasmic reticulum and secreted proteins, were covered. These 
predictions could help in formulation of better therapeutic options against the virus. 
As per the protocol of RV, secreted and membrane proteins are of special interest, 
therefore, filtered for further analysis. To predict the number of transmembrane 
helices TMHMM Server v. 2.0 (http://www.cbs.dtu.dk/services/TMHMM/; Krogh 
et al. 2001) was used. 


15.4.4 Signal Peptides 

Signal peptides are known to impact the immune responses and possess high epitope 
densities. Moreover, most of the known vaccine candidates also possess signal 
peptides. Hence, it is worthwhile to predict signal peptides in proteins prior to 
epitope predictions. Signal-BLAST web server is used to predict the signal peptides 
without any false predictions (http://sigpep.services.came.sbg.ac.at/signalblast.html; 
Frank and Sippl 2008). The prediction options include best sensitivity, balanced 
prediction, best specificity and detect cleavage site only. We choose to make the 
predictions using each option, and the proteins predicted as signal peptide by all the 
four options were preferred for further investigation. 


15.4.5 Adhesion Probability 

The most appropriate targets as vaccine candidates are those which possess the 
adhesion-like properties because they not only mediate the adhesion of pathogen’s 
proteins with cells of host but also facilitate transmission of virus. Adhesions are 
known to be crucial for virulence and are located on surface which makes them 
promptly approachable to antibodies. The stand-alone SPA AN with a sensitivity of 
89% and specificity of 100% was used to carry out the adhesion probability 
predictions, and the proteins with having adhesion probabilities higher than or 
equal to 0.4 were selected (Sachdeva et al. 2004). 


15.4.6 BetaWrap Motifs 

BetaWrap motifs are dominant in virulence factors of the pathogens. If the proteins 
are predicted to possess such motifs, then they are appropriate to be taken under 
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reverse vaccinology studies. BetaWrap server is the only online web server to make 
such predictions. The proteins having P-value lower than 0.1 were anticipated to 
contain BetaWraps (http://groups.csail.mit.edu/cb/betawrap/betawrap.html; Bradley 
et al. 2001). 


15.4.7 Antigenicity Predictions 

For added identification of the antigenic likely of the proteins, they were subjected to 
VaxiJen server version 2.0. It is basically an empirical method to hunt antigenic 
proteins. So, if the proteins are not found antigenic using other sequence-based 
methods, then they can be identified using this method. This step confirms the 
antigenicity of proteins selected using above-mentioned steps (http://www.ddg- 
pharmfac.net/vaxijen/VaxiJen/VaxiJen.html; Doytchinova and Flower 2007). 


15.4.8 Allergenicity Predictions 

For being a probable vaccine candidate, the protein should not exhibit the 
characteristics of an allergen as they trigger the type-1 hypersensitivity reactions 
causing allergy. Therefore, to escape out such possibilities, the proteins were also 
subjected to allergenicity predictions using Allertop (http://www.pharmfac.net/ 
allertop; Dimitrov et al. 2014) and AlgPred tools (http://www.imtech.res.in/ 
raghava/algpred/submission.html; Saha and Raghava 2006a, b). 


15.4.9 Similarity with Host Proteins 

To check whether the filtered proteins possess any similarity to host proteins or not, 
the standard Blastp (http://blast.ncbi.nlm.nih.gov/blast) searches were performed. In 
case of sequence similarity, there is a feasibility of generation of immune responses 
against own cells. 


15.4.10 Epitope Mapping 

Predicting the epitopes binding to MHC class I is the main decisive phase of the 
RV to carry out valid vaccine predictions. The epitopes showing their affinity for 
T-cells were first selected via IEDB (http://tools.immuneepitope.org/mhci/), 
ProPred-I (http://www.imtech.res.in/raghava/propredl/; Singh and Raghava 
2003), BIMAS (http://www-bimas.cit.nih.gov/molbio/hla_bind/; Parker et al. 
1994) and NetCTL tools (http://www.cbs.dtu.dk/services/NetCTL/; Larsen et al. 
2005). For the epitope to be included in the hit list, it must be predicted by any 
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three of these four mentioned tools. For making the predictions of B-cell epitopes, 
BepiPred (http://www.cbs.dtu.dk/services/BepiPred/; Larsen et al. 2006) and 
ABCPred tools (http://www.imtech.res.in/raghava/abcpred/ABC_submission. 
html; Saha and Raghava 2006a, b) were used. The overlapping B-cell and T-cell 
epitopes were identified. 


15.4.11 Docking of the Predicted Epitopes with HLA-A*0201 

The predicted epitopes were docked with receptor that is HLA-A*0201 using 
ClusPro (http://cluspro.bu.edu/login.php; Kozakov et al. 2017) that is an automated 
protein-protein docking web server. The literature searches provided the information 
of conserved residues of the receptor site. The default parameters were used for 
docking (Comeau et al. 2004a, b; Kozakov et al. 2006). 


15.5 Results and Discussion 
15.5.1 Retrieval of Proteome from NCBI 

A total of 40 different sequenced strains of coronavirus are available at NCBI. 
Among them 7 strains are pathogenic to humans. Various information regarding 
source, host and collection of these strains are presented in Table 15.1 and 15.2. This 
information can be obtained from NCBTs genome database, the Virus Pathogen 
Database and Analysis Resource and Genomes OnLine Database (Liolios et al. 
2006; Pickett et al. 2012). The MERS strain is taken as seed genome as it is the 
most prevalent and disastrous strain among others. Its proteome consists of total 
11 proteins as shown in Table 15.3. The results of sequence similarity to reveal 
orthologs using Blastp are shown in Table 15.4. The sequences with greater than 
30% identity score are considered as homologs. The phylogenetic tree is depicted in 
Fig. 15.1 and the MERS-CoV, taken as seed genome, found clustered with different 
Bat coronaviruses. 


15.5.2 Analysis of Secondary Structure 

The results of analysis of secondary structure of the proteome using ExPASy tools 
are shown in the Table 15.5. From the analysis of charge on the residues and pH 
values, it is concluded that six of the proteins are basic and positively charged unlike 
allergens which are acidic in nature. However, five proteins are acidic and show 
negative charge. The negative GRAVY score of five proteins justify them to be of 
hydrophilic nature with majority of the residues positioned towards the surface. For 
the rest of six proteins, the GRAVY score is positive; it means that these are 



Table 15.1 Information of coronavirus strains available at NCBI 


15 Advanced In Silico Tools for Designing of Antigenic Epitope as Potential.. 


335 


C/3 

g 


4—1 

o CD 


No 

proi 

as 

as 

o 

H 

CN 

H 

H 

H 

as 

r- 

oo 

oo 

as 

oo 

o 

H 


as 

o 

so 

oo 

as 

o 

o 

H 

CN 

CO 

-er¬ 

uo 


G" 

uo 

CO 

-er¬ 

-er¬ 

r- 

as 

as 

as 

as 

as 

as 

% 

uo 

uo 

as 

as 

as 

-e|- 

as 

as 

as 

as 

as 

as 

CD 

H 

H 

CN 

CN 

CN 

G- 

so 

so 

so 

so 

SO 

so 

G 

H 

H 

H 

H 

H 

H 

H 

H 

H 

H 

H 

H 

G 

O 

O 

1 

O 

1 

o 

1 

o 

1 

o 

1 

O 

1 

o 

I 

o 

1 

o 

1 

o 

1 

o 

| 

o 

I 

G 

CD 

O 

O 

U 

U 

U 

u 

U 

U 

O 

O 

U 

U 

o 

z 

z 

z 

Z 

z 

z 

z 

z 

z 

z 

z 

z 



C/3 

g 

c3 

& 

C/3 

C/3 


> 

cd 

a 

o 

o 

U 


C/3 

2 

> 

cd 

n O 

8 o 
8 ^ 
-a a 

I ^ 

H PQ 


C/3 


> 

cd 

G 

O 

5-4 

o 

o 

cd 

• rH 

G 

G 


uo 

co 

I 

co 

H 

g 

PP 


5-H 

CD 

44 


Dh 

C/3 

b 

> 

cd 

a 

2 

o 

o 

td 

& 


r- 
>3 oo 

5 H —H 

1 K 

.s < 

C4 c/3 

C/3 

CD 
5-1 


CD 

a 

• i-H 
> 
o 


2 

> 

cd 

G 

O 

Sh 

o 


PQ o 


PC 

o 

c/5 

5 

b c 
o •£ 

CS O 
.U rO \D 

Ph c/3 OS 
c/3 — 

CD c 
•> 

.£ G 

> 2 o 
PQ 8 5 


as 

r-H 

U 

H 


CO 

I 

oo 

-d- 


PQ 

C/3 

s 

> 

G OO 

2 ° 
O 
o 


o 

CN 

OQ 

3 o 

PQ PQ 


uo 

H 

g 

cc 

C/5 

b 

> 

cd 

G 

o 

5-4 

o 

o 

CD 

G 

o 

5-1 

o 

Oh 


C/3 

2 

> 

cd 

G 

O 

5-H 

o 

CD 

CD 

CD SO 
CD Z 

.'G Z> 
-G sy 

£ PC 


t-- 

H 

Q3 

PQ 


C/5 

s 

> 

cd 

G 

8 

o 

CD 

£ 

1 

CG 

00 


c/3 

8 

> 

cd 

G 

O 

^H 

o 

o 


-P 

o 

^H 


HD OO 
£4 1 —\ 

w) H 
=3 PQ 

S x 


C/5 

2 

> 

cd 

G 

O 

5-H 

o 

CD 

G 

O 

5-1 

CD 

-G 


as 


5 P^ 
.SPPQ 
Z PC 


o 

CN 

g 

PP 

C/5 

2 

> 

cd 

G 

O 

5-H 

o 

o 

G 

O 

CD 

OX) 


T3 _ 
<D -G 

4-> C/3 

JJ | | 

G Q r 

h£ h 


cd 

• r-H 

G 

G 


cd 

Q< 


CD 

G 

> 

O 

PQ 


CD 

G 

> 

O 

PQ 


cd 

PQ 


CD 

G 

CD 

5-H 

o 

Oh 


CD 


T3 

^H 


-P § 

-G CD ' G 

;> >3 P4 

t> CD 00 


CD 

• i-H 

04 - 

OX) .G 
cd X) 

2 2 


£ g 
00 g 

g 5 


G 

O 

CD 

OX) 


o 

G 

• r-H 

00 CN 


CN 

CN 


CO 

CN 


-d- 

CN 


uo 

CN 


SO 

CN 


r- 

CN 


oo 

CN 


as 

CN 


O 

CO 


CO 


CN 

CO 


C/3 

O '§ 

8 2 oo 
Z pH 


as 


so 


oo 


CN 


CN 


oo 


as 


as 


oo 



UO 

oo 

r- 


r- 

o 

CO 

uo 

uo 

as 

o 

H 


-et- 

H 

-eT 

CO 

r- 

uo 

-ef 

-d- 

H 

H 

CN 

CN 

so 

r- 


oo 

uo 

as 

oo 

o 

CO 

o 

O 

O 

CD 

CN 

-t 

uo 

uo 

so 

CN 

as 

CO 

oo 

as 

as 

Os 

G 

O 

o 

o 

o 

O 

H 

H 

o 

o 

o 

o 

O 

O 

o 

o 

© 

© 

© 

o 

i 

o 

i 

© 

© 

© 

© 

o 

G 

CD 

U 

U 

U 

u 

U 

U 

U 

U 

U 

U 

U 

U 

o 

Z 

Z 

Z 

z 

z 

z 

z 

Z 

z 

z 

z 

z 


CD 


c/3 

G 

cd 

& 

C/3 

C/3 


> 

cd 

G 

o 

5-H 

O 

U 


C/3 

8 

> 

cd 

G 

O 

5-1 

o 

CD 

G 

cd 


G 

PP 


C/3 

8 

> 

cd 

G 

8 

o 

CD 

w S 

as 5 
CN < 
CN 00 


C/5 

8 

> 

cd 

G 

O 

S-h 

O 

o 

G 

G CO 

s ^l - 
o 
o 


G 

PP 


C/5 

2 

> 

co 

G 

O 

5-4 

o 

CD 

G 

03 


G 

PP 


CO 

so 

H-l 

z 


C/5 

2 

> 

cO 

G 

O 

o 

o 

G 

03 


G 

PC 


g 

PP 


CD 

'£P 

CD 

G 

CD 

G 

cO 


G 

PC 



> 

o 

O 

4—> 

PQ 


03 

-P 

C/3 

G 


8 


C/5 


*o 

g 

PC 

C/3 


> 
03 

G 

§ 


03 

-P 

C/3 

G 


CD 

C/3 

G 

____ Q I O 

Oh 8 04 


Infected 

host 

Human 

Human 

Human 

Human 

Human 

Human 

Human 

Bovine 

Bat 

Bat 

Bat 

Bat 

o 













G 













• 










o 

H 

CN 

in 

H 

CN 

CO 

-d- 

UO 

so 

r- 

oo 

Os 

H 

H 

H 


Os 

g 

PP 

C/3 


> 

03 

G 

O 

o 

CD 



336 


M. Dangi et al 


C/3 

g 


4—1 *fH 

o <D 


a 

CD 

OV 

H 

H 

On 

r- 

<N 

H 

0 

H 

On 

0 

H 


VO 

CO 

H 

CO 

CO 

0 

r- 

H 


On 

00 

r- 

0 

G - 

VO 

H 

y—i 


OV 

O 

00 

H 

VD 

r- 

CN 

O 

CD 

VO 

r- 

00 

CN 

CN 

CO 

>n 

VO 

G 

r-H 

r-H 

H 

<N 

CN 

CN 

CN 

CN 

O 

O 

1 

0 

1 

0 

1 

O 

1 

O 

1 

0 

1 

O 

1 

O 

1 

G 

CD 

O 

U 

U 

O 

U 

0 

O 

O 

O 

Z 

z 

z 

z 

z 

z 

z 

z 



c/3 

g 

3 

-b 

C/3 

C/3 


> 

CCS 

a 

o 

Dh 

o 

U 


s 3 

-s d 

§ K 
S 3 
c .5 I R 


1 | 
§ o 

u 8 


w 

ffi 

C/3 

S 

> 

ccS 

g 

o 


o 


ccS 

P< 


C/3 

s 

> 

ccS 

G 

S 

o 

o 

■4—> 

ccS 

DC 

C/3 

3 

Id 

C/3 

=3 

O 

P< 


in 

H 

W 

DC 

Ph 

Q 

U 

C/3 

B 


> 

G vo 


5 

DC 


g 

a 

o 

o 


o 
o 

CN 

< 

c5 <zi 

PQ P 


C/3 

=3 

CD 

CD 

ccS 

G 

•sc 

w 

C/3 

a 

> 

ccS 

G 

a 

o 

o 

ccS 

■4—> 

CD 

PQ 


73 

CD 

<£ I 

G Q 
3 -G 


§ 

•s 

O 

o 


-O 

•s 

oi 


ccS 

PQ 


ccS 

PQ 


>3 

s 

CD 

CD 

<3 

S 
• ^ 

£3 


o 

G 

00 


CO 

CO 


G" 

CO 


uo 

CO 


VO 

CO 


r- 

co 


c/3 

G 


ccS 

3 

C/3 


<N 

H 

o > 

G 


a 


G 

O 


CO 


Eal g r- 

Q SQ 

U 04 

S -5 0 
> S £ 


C/3 

a 

> 
ccS 
G 
O 

$-H 

O 
CD 

3 o 

-9 t>xj 

& § 

w :3b 

4-> Or _ 

c3 ^3 cd 

PQ N PQ 


<N 

s 

DC 

C/3 

a 

> 

ccS 

G 

a 

o 

CD 

ccS 


04 

G 


s 


C/3 

a !§ 

PQ PQ 


00 On O 

CO CO G" 


o CD 

-4—> 

o a ^ 00 r- o 00 I g- 

Q 



r- 

00 

r- 

VO 

r- 

00 

VO 

0 

=tfc 

in 

00 

CN 

CO 

co 

CO 

G - 

0 

VO 

On 

CO 

G- 

G- 

G- 

vo 

00 

<D 

On 

On 

0 

O 

O 

O 

0 

0 

G 

O 

O 

H 

H 

H 

H 

H 

H 

5— 

O 

O 

1 

O 

1 

O 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

t 

G 

CD 

u 

u 

U 

O 

O 

O 

O 

U 

0 

z 

z 

z 

Z 

Z 

Z 

Z 

z 


C/3 

G 

a 

C/3 

C/3 


> 

ccS 

G 

O 

Dh 

O 


<N 

G 

-o n 

C/3 C/3 

£ 2 

^ > 
Gh g 

o 


o 

CD 


CN 
G S 

^ DC 

C/3 

G 
-G 
CD 
O 


C/3 


G 

O 

Dh 

O 


U M O 


o 

G 

-G 

Q< 


C/3 

a 

> 

ccS 

G 

O 

Vh 

O 

o 

CD 

G 

G 

cr 


CD W 


PQ 


C/3 

a 

§ 

G 

O 

Dh 

O 

o 

■4—> 

ccS 

PQ 


< 


C/3 

a 

§ 

G 

O 

Dh 

O 
CD 
■4—> 

ccS 

PQ 


00 

a § 
a 

23 c/3 

a g 

CD -G 

■4—4 >> 

9« G 

.a c 
.2 a 
S 8 


JD 00 
G C/3 

f B 

« > 
S 

-3 o 

OD g 

PQ 8 


C/3 


> 

CCS 

G 

O 

Dh 

O 

CD 

kO 

CD 

* 

G 

H 


Infected 

host 

Bat 

Bat 

Equine 

Bat 

Bat 

Bat 

Whale 

Turkey 

d 









G 









. 

CO 

G" 

in 

VO 

r- 

00 

On 

0 

00 

H 

H 

H 

H 

H 

H 

H 

CN 



Table 15.2 Detail information about seven strains of coronavirus which are pathogenic to humans 


u 

o # 


co 

00 

co 


oo 

o 

3 - 


oo 

X 

co 


in 

3- 

co 


(N 

co 


3 

o 

*x 

aj 

& 

I 8 

U X 


X 

O 

o 

(N 

CN 

o 


x 

o 

o 

CN 

CN 

o 


x 

o 

o 

<N 

cn 

o 


X 

O 

o 

CN 

cn 

d 


x 

o 

o 

OJ 

cn 

o 


c/5 

o 

ffi 


o 

3 

o 

X 

3 


X 

O 

X 

On 


X 

Os 

CN 

d 

Os 


so 

o 

VO 

Os 


so 

O 

O 

On 


SO 

O 

SO 

Os 


<D 

— 

a « 

3 Oh 

u d 


& 

o 

C/5 


<D 

•4—» 

c3 

O 

C/5 


<D 

-j—> 

o 

C/5 


X 

o3 

O 

C/5 


CD 

-j—> 

O 

C/5 


o 

<D 

~ ■g’ 

PQ 

U .2 Q 

Z PQ 0 


co 

os 

3- 


o 

o 

in 

in 


oo 

co 

3- 

in 


o 

VO 

os 

3- 


On 

co 


in 


oo 

3 

• »-H 

o 

3 

<D 

3 

o H 

CU 

cn 


u 

d 

X 

£ 


00 

3 

• »-H 

UJ o 

g 5 


00 

3 


00 

3 


o 

3 


<U aj 

00 GO 


d 

X 

£ 


ao o 

a s 


o 

3 


(U 

3 

cr 


<u cu 

00 GO 


d 

X 

£ 


ao o 

a s 


o 

3 


aj 

3 

cr 


<o aj 

00 GO 


d 

X 

£ 


a 

o 

3 


00 

3 

• i-H 

o 

3 

aj 

3 

cr 


<u aj 

00 GO 


JO 

d 

X 

£ 


00 

s 

• i-H 

o o 

a g 

§ gn 

o u 

00 GO 


oo 

3 

o 

3 

o o 
Sh d 

ST c 

o aj 
cn o 



3 

3 

O 

‘X 

3 

Z 


00 

o 

d 

• t-H 

X) 

o 

H 

o 


S-H 

o 

- 4 —< 

o3 

*h 

O 

•a 

J 


cd 

<D 

ffi 


x> 

Oh 


>-> 3 

s ^ 

O 3 
00 3 

< u 


4h X 

° 5 

d | 'P 

(5 C fi 3 

O O O • rO 

> > > gp 

X 3 3 O 

5 d x © 


d °° 

° 3 X 
>N 5 3 

x d d 

O B ^ 

> GO d 

■a a 


o 


o a ^ 

d < e 


) 


e 

o 

'X 

o 

o 

X o 

o d 

U x 


o 

o 

(N 


co 

o 

o 

(N 


8 o 

o o 

-H (N 


X 

o 

o 

(N 

co 

(N 

co 


oo 

cj S 

(N O 

-h (N 


GO 

o 

ffi 


C/1 

O 

d 

X C 
O cd 

ti g 

« 3 

> X 


GO 

O 

eS 

X 3 
O cd 

ti g 

« 3 

> X 


GO 

O 

d 

X 3 
O cd 

ti g 

« 3 

> X 


GO 

O 

d 

X 3 

O C3 

d g 

« 3 

> X 


GO 

O 

d 

X 3 
O cd 

d g 

« 3 
> X 


GO 

3 

• y-^ 

& 

O 

Vh 

X 


oo 


On 


SO 


OO 


00 

3 

3 

J 


3 

r- 

co 

d 

(N 


in 

r~; 

Os 

(N 


3 

00 

co 

r- 

o' 

co 


co 

in 

in 

d 

(N 


X 

(N 

On 

Os 

(N 



pq 

Os 

(N 

(N 


3 

X 

cn 


(N 

l-H 

o 

H 

aj 

d 

d 

C/1 


On 


3 

T3 


3 

U 

in 

CD 

H 

cn 

3 

HH 

- d 

is 

u 

H 


r-2 

o 

"df 

U 

3 

is 

C/5 

a 

zn 

< 

> 

C/5 

O 

cn 

< HH 


ao -h 

3 ^ 

d 

C/5 hH 

HH HH 


o 

z 

d 

o 

< 


in 

3- 

SO 

(N 

O 

o 

U 

Z, 


oo 

r- 

3" 

O 

o 

U 

Z 


r-~ 

3" 

in 

o 

o 

U 

Z 


co 

oo 

in 

o 

o 

U 

Z 


r- 

r- 

in 

x 

o 

o 

U 

Z 


a 

3 

3 

^3 

*3 

Vh 
-H—* 

cn 


3 

3 

s 


<< 

3 

3 

3 

§ s 

C ON 
Cl IN 
Ci CN 


tn 

d 

tn 


to 

3 

.c 

3 

3 

3 

8 

3 

c> 


3 

3 


to 

3 

.c 

3 

3 

3 

8 

3 

O 


°o 

s 

o 


3 

3 


GO 

3 

.c. 

3 

3 

3 

8 

3 

3 


to 

3 

c 


O 

3 

d 


CN 


co 


in 
























Table 15.2 (continued) 


u 

o # 


r- 

d 


CN 

3" 


3 

_o 

3 

I & 

r 9 3 

U T3 


On 

O 

O 

(N 

d 


o 

CN 

oo 

cn 


C/3 

o 

ffi 


a 

o 

3 

o 

x 

3 


VO 

O 

vO 

CO 


VO 

O 

VO 

On 


3 

— 

3 

-3 3 

3 

U £* 


3 

-H-> 

o 

C/3 


-3 

o 

C/1 


o 

cu 

- ‘s’ 

PQ Oh 

U .2 Q 

Z PQ 9 


D 

o 

t—H 

cn 

D 

cn 

cn 

co 

oo 

cn 

t—H 


oo 

3 

• rH 

3 ^ 

3 w> 

Eh ^ 
cr 3 

3 £3 
C/D go 


^3 

'o 

-3 

£ 


1 

O 

3 


bt) 

3 

• rH 

o 

3 

3 

3 

CT 


bt} 

3 


3 gj 

bt) S 


u 

o 

-3 

£ 


au 3 

a s 


o 

3 


3 

3 

or 


3 a) 
bt) go 


bt) 

3 

• rH 

3 

3 

3 au 

Eh -b 

3 4) 

00 O 



3 

O 


'B 


• rH 

> 

>> ~ "O 

o 

■3 

3 

3 

3 O' au 
3 < .-3 

T3 

bt) 

13 

<D 

ffi 

■HH 

O 

(H 

Oh 

£> n C3 3 

3 

3 


3 

O 

‘■3 

o 

3 

3h 3 

O 3 
U T3 


On 

O 

O 

CN 

cn 


r-~ 


D 


CN 


CN O 
-H CN 


C/3 

o 

ffi 


C/1 

3 

03 

£ s 

au 3 

ti B 

« 3 

> 33 


c/i 

3 

03 

V* 

•8 

e 

3 

> 


GO 

3 

• rH 

3 
-*—» 
O 

}H 

Oh 


CN 


bt) 

3 

3 

hJ 


CD 

I/O 

On 

o' 

cn 


3 

On 


O 

cn 


Source 

information 

Strain:4408 

Strain: 

HCoV- 

EMC 


o 

m 


V) 



CO 

oo 

6 

(N 

CO 

Z 

© 

© 

3 

3 

U 

U 

< 

z 

Z 


a 

3 

3 

- 3 

h 

c/d 


s °9 

V. O 


C/1 

3 


in 


3 


3 

« 



3 

N- 

o 

-H-» 

c3 

3 

O 


3 

s 

3 

g 

© 

© 

• rH 

H 

• rH 

Oh 

CZ3 

© 

3 


O 

3 


s 

a 

^ i 

C/l 


£ 

3 

O 

S-H 


O 

3 

c/i 


VO 


r- 




















15 Advanced In Silico Tools for Designing of Antigenic Epitope as Potential.. 


339 






O 






O 







CD 





O 

• rH 

GO 





o 

O 

4-^ 

o 





• i-H 

CD 

4—* 

o 


O 


<u 

a 

cd 

$h 

• i-H 

CD 
h—> 

O 

Hh 

Oh 

CD 

H—> 

O 

^H 

Oh 

5-h 

Oh 

O 

o 

N> 

O 

00 
4—> 

O 

CD 

+j 

O 

—H 

O 

<D 

4—> 

O 

$—1 

O 

0) 

4—* 

O 

S-H 

Oh 

ID 

Hh 

Oh 

HD 

O 

O 

o 

4—* 

2 

• t-H 

HD 

H 

O 

Hh 

Oh 


fi 

0-> 
4—> 

O 

Oh 

'o 

Oh 

W) 

CD 

O 

Hh 

Oh 

cn 

Oh 

< 

Oh 

PP 

O 

^H 

Oh 

vn 

Oh 

O 

Id 

> 

c3 

Hh 

Oh 

O 

Ph 

Td 

-O 

OO 

P4 


O 

< 
r-H 


• rH 

Cl 

00 

00 

00 

00 

c 

HD 

O 

PP 


Ph 


00 

z 

Z 

Z 

Z 

W 

s 

Z 

o 


-C 













W) 

H 

oo 

cn 

cn 

ON 

NO 

N" 

CN 

On 

cn 

CN 


o 

On 

n 

VO 

o 

o 


CN 

OO 

H 

H 

H 


1) 

cn 

o 

cn 

r-H 

H 

CN 

CN 


CN 


H 


hJ 


n 

r-H 










g 

H 

H 

H 

H 

H 

r-H 

H 

H 

H 

H 

H 


• i-H 

C/5 

C/5 

cn 

CN 

N- 

vn 

NO 

n 

oo 

On 

o 

H 

CN 


o 

O 

O 

o 

o 

o 

O 

O 

H 

H 

H 


CD 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 


CD 

n 

t> 

IN 

IN 

N 

n 

IN 

N 

IN 

IN 

N 


CD 

tj- 

nf 


of 






nf 

xl- 


< 

o 

O 

O 

O 

O 

o 

O 

O 

O 

O 

O 


On 

On 

On 

On 

On 

On 

On 

On 

On 

On 

On 



O 

O 

O 

O 

O 

O 

O 

O 

O 

O 

O 


0) 

O 

O 

O 

O 

O 

O 

O 

O 

O 

O 

O 


4—> 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 


2 

Ph 

Ph 

Ph 

Ph 

Ph 

Ph 

Ph 

Ph 

Ph 

Ph 

Ph 


Oh 


>h 

H 

H 

>h 

>h 

H 

>< 

>h 

H 

H 



H 

H 

CN 

cn 


vn 

NO 

r- 

OO 

On 

o 


W) 

o3 

hj 

O 

O 

O 

o 

O 

o 

O 

o 

o 

O 

H 


Oh 

Oh 

Oh 

Oh 

Oh 

Oh 

Oh 

Oh 

Oh 

Oh 

Oh 


bD 

T 

W) 

T 

bD 

T 

bD 

T 

bD 

T 

bD 

T 

bD 

T 

bD 

T 

bD 

T 

bD 

T 

bD 

T 


C/5 

oc 

oc 

OC 

OC 

OC 

OC 

OC 

OC 

OC 

OC 

OC 


o 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 


o 

H 

H 

r-H 

r-H 

H 

r-H 

H 

H 


H 

H 


H-l 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 


C/5 

X) 

X) 










C/5 


c3 

c3 



c3 






hO 

CD 

H 



cn 



vn 




oo 

o 

O 

'ip 

'IP 


% 

'IP 

'IP 

'tP 

w 

s 


'IP 

0) 

4—> 

1 hJ 

o 

o 

00 

o 

o 

o 

o 

z 

o 
















5h 













Oh 













cn' 













cn 


N" 


IN 

cn 

H 

cn 


oo 

CN 

N 

o 

oo 

Q 

VO 

H 

r-H 

^|- 

oo 

cn 

H 

cn 

H 

o 

o 

ON 

o 

tj- 

un 

vn 

oo 

y—i 

oo 

vn 

oo 

vn 

OO 

H 

r-H 

4—> 

cn 

r-H 

VO 

vn 

NO 

NO 

N 

N 

oo 

On 

On 

o 

i 

00 

H 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

U 













z 













> 

o 


On 

On 

NO 

CN 

CN 

cn 

O 

O 

cn 

NO 

CN 

u 

tj 

n 

n 

VO 

cn 

vn 

On 

^|- 

On 

vn 

NO 

NO 

a 

n 

CN 


vn 

oo 

O 

OO 

vn 

oo 

vn 

IN 

oo 

4—* 



H 

vn 

vn 

NO 

NO 

N 

N 

oo 

OO 

P< 

W 

00 



CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 













s 













4—1 













o 













C 


CN 

CN 


vn 

NO 

1"- 

OO 

On 

O 

H 

vn 

o 


o 

o 

On 

On 

ON 

On 

On 

On 

o 

o 

o 

4 — > 

Q 

NO 

NO 

vn 

vn 

vn 

vn 

vn 

vn 

NO 

NO 

o 

o3 

i—i 




"sf 




"sf 



o 


CD 

VO 

VO 

vn 

vn 

vn 

vn 

vn 

vn 

vn 

vn 

H 

c 

C 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

CN 

On 

<s 

O 

O 


"si- 

^t- 





^t- 

"st - 

"st" 

On 

H-H 

C 

r-H 

H 

H 

r-H 

r-H 

H 

H 

H 

H 

H 















ro 













in 


























<u 

o 












Si 

c 










CD 

H 

(9 

1- 

00 

H 

CN 

cn 

N" 

vn 

NO 

N 

oo 

On 

H 

H 




C/5 

3 

00 

G 

• i-H 

C/5 

CD 

a 

o 

G 

CD 

00 

73 

<D 

CD 

C/E 


C/3 

G 

CD 

■ 4 -> 

o 

5-1 

CD 

CD 

_G 


C/3 

<D 

JG 

a 

c3 

CD 

C/3 

00 

o 


o 

X 

C+H 

O 

C/3 

-4-J 

3 

C/3 

CD 

CZ 

in 

Q) 

X2 

(0 


w 

ON 

(N 

(N 


> 

oE 

EE 

o 

iH 

o 

o 

§ 

a 

k 


.5 

£ 

a 

o 

vh 

O 

o 


— 

03 

h—> 

§ 

§ 

a 

k 


£ 

ee 

o 

vh 

O 

o 

§ 

a 

k 


to 

s 

C3 

SC 

s 

O 

O 

SE 

53 


5 


EE 

03 

© 


O 

ee 

ee 

_o 

ir. 

j~. 

03 

03 

03 

< 


ee 

03 

13 


oo 

O 

© 

© 


cE 

H 


Z 

ffi 


m 

M3 

Z 

z 


> 

oE 

Eh 

O 

iH 

o 

03 

S 

a 

s 


on 

3 

O 


C<r3 

3 

.Ch 

'5 

« 

s 

2 

o 

03 

Co 

0= 

Co 


o 

ee 

EE 

_o 

cc 

CZJ 

03 

03 

03 

< 


EE 

03 

© 


O 

ee 

ee 

_o 

'czj 

C/1 

03 

0) 

03 

< 


ee 

03 

13 


O 

EE 

EE 

_o 

C/3 

C/1 

03 

03 

03 

< 


EE 

03 

© 


O 

EE 

Eh 

_o 

C/1 

C/1 

03 

03 

03 

< 


EE 

03 

© 


O 

EE 

EE 

O 

• rH 

C/5 

C/5 

03 

03 

03 

< 


03 

I 

EE 

EE 

• rH 

B 

O 

C-H 

Oh 


O 

EE 

EE 

_o 

C/3 

C/1 

03 

03 

03 

< 


o 

EE 

CO 


d 














# 






oo 





© 


© 







CM 











l> 









m. 






cn 





cn 


in 







d 






cn 


1 



cn 


m 


1 

1 

1 

1 

1 

m 


1 


1 


— 





— 


i — 







— 






cri 

£^ 




o 


z 

EE 






in 






m- 

• rH 




in 


in 

ME 






in 03 






in 

CO 

<0 ^ 

C/3 O 




in 

cn 

« 3 

c/i o 

in 

m 

O 

- s dn 






m ce 
cn cs 






r- 

a a 




r- 

a a 

r- 

ME Qh 






r- b 

EE 





o 

1 

O Oh 

OE >. 




o 

1 

o ceh 
‘-7 

o 

1 

2 o 
B ME 






°i a 

• rH 

B 





Oh 

Oh 75 




Oh 

Oh 75 

Oh 

H >» 






Oh 03 

o 





I z 

<u 2 

Vh © 

1 



z 

<u ° 

Vh Oh 

z 

C/5 bX) 

1 

1 

1 

1 

1 

z a 

Vh 

Oh 

I 


1 


d 





# 















cn 





CM 


CM 







Os 


cn 




oo 





OO 









cn 


oo 




in 





in 









d 


o 




cn 


1 



cn 


m 


1 

1 

1 

1 

1 

nl- 


-7- 


1 


oo 

£E 

• rH 

0 

M—» 




Co 

EE 

• 

OJ 

CM 







Z EE 

Z 3 


oo 




T—H 

g 




T - 1 

M—» 

CM 







CM O 


CM 




1 in 

Qh 




in 

2 

in 

X 






•n ^ 


in -a 




oo 

>-> 




oo 

Qh 

oo 







OO »H 


00 ’7 




cn 





cn 

>-» 

cn 







cn 03 


n, 




o 

o 




o 


o 

O 






O EE 






cn 

Qh 




cn 

o 

cn 

S-H 






cn cs 


cn 2 




O 

_o 




o 

Oh 

o 

Cd 






© 3 


O o 




O 

o3 




o 

cd 

o 

OJ 






°, ■§ 


O, aj 




1 





1 


1 







i a 


1 75 




Oh 

£ 




Oh 


Oh 

• 






Oh 03 


Oh B 




z 

rH 

o 

1 



z 

HH 

o 

z 

Qh 

C/5 

1 

1 

1 

1 

1 

z a 


Z c 


1 


d 


# 














# 




cn 


nl- 



o 


oo 









cn 




CM 


cn 



h-H 


CM 









oo 




oo 


in 



od 


z 









o 




© 


in 



cn 


nl- 


1 

1 

1 

1 

1 

i 


n|- 


1 


— 


— 



— 


— 









i * ♦ 




od 


od 



od 


oo 

C 

• f-H 








CM --h 
^ n. 




cn 

• rH 

cn 


• ^ 

cn 

• t-H 

cn 

O 











CM 

cn 

0 
M—> 

o 

CM 

cn 


OJ 

M—> 

o 

CM 

cn 

OJ 

M—> 

o 

CM 

cn 

M—» 

o 

Vh 








<71 ^ 
cn 2 




r- 

X) b 

r- 

X) 

Vh 

c- 


r- 

Cd 








e- o 




r—H 

cs 2^ 

•—i 

03 

Oh 

*—i 


’—i 

0 O 








03 




1 


l 


i-^T* 

1 

H >> 

l 

zZ O 








1 71 




Oh 

z 

o ex 

Oh 

z 

Ch 

o 

o 

ex 

Oh 

z 

O 

O Om 

Oh 

z 

'X ^ 
to) 

1 

1 

1 

1 

1 

i 


Oh B 

z g 


1 


d 




















os 





c- 















© 





oo 















in 





cn 















© 


1 



cn 


1 


1 

1 

1 

1 

1 

i 


1 


1 


CM 





CM 















d 

£3 




od 















oo 

• i-H 




© 

• rH 














c- 

cn 

<0 3 

c/i O 




r- 

cn 

« 3 














o 

ce a 




o 

cd a 














o 

o © 




o 

O Oh 














1 

OE >. 




1 

•-EE En hh 














Oh 

z 

C "o 

Vh CX 

1 



Oh 

z 

rep 
pol 
1 ah 

1 


1 

1 

1 

1 

1 

i 


1 


1 







# 















CM 





in 









y—i 











oo 









OO 






OO 





in 









cn 






© 


1 



cn 


] 


1 

1 

1 

1 

1 

cn 


1 


1 


CM 





CM 









P 






l> 





Z 

£3 








cn 






© 

• t-H 




H- 

• i-H 








m -h 






Oo 

e'¬ 

<0 B 

c/i O 




o\ 

r- 

« 3 

c/i o 








© .a 

t 03 






en 

a 




cn 

ce a 








cn EE 






CO 

O Oh 




© 

O Oh 








© 2 






1 

OE >> 




1 

CE >. 








Oh 






Oh 

Oh "3 




Oh 

Oh "3 








Oh ZT 






I z 

2 

Vh © 

] 



z 

ME 2 

In Oh 

1 


1 

1 

1 

1 

1 

Z S 


1 


1 


d 














# 






CO 


nf- 



CM 









in 


© 




oo 


CO 



oq 









nt-_ 


so 




CM 


in 



CM 









in 


oo 




© 


n|- 



nl" 


1 


1 

1 

1 

1 

1 

m- 


cn 


1 


CM 


— 



— 









Z EE 


rrH 




Co 


o 


£3 

o 

£3 








in mo 


* lj 
00 

m S 




© 

• rH 

in 


• rH 

in 









in o 





oo 

<u 

oo 


OJ 

oo 

OJ 








OO Vh 


OO o 




oo 

o 

oo 


M— > 

0 

oo 

M— > 

Q 








OO Oh 





CM 

O 5-h 

CM 


Vh 

CM 

Vh 








CM u 
oo .p 

i s 


CM o 

EE 



oo 


oo 

cd 

ex 

oo 

CE Oh 









OO H) 

• rH 

0 



1 

t-h En 

1 



1 

t—h E-. 









1 71 




Oh 

I z 

G ’o 
o © 

Oh 

z 

^+H 

o 

o 

ex 

Oh 

z 

G O 

O Oh 

1 


1 

1 

1 

1 

1 

Oh 2 

z a 


Oh B 
Z g 

O 

Vh 

ex 

1 



£E 

• 

OJ 





EE 

• 

OJ 


c 

03 

HH 

EE 

• 

03 

H—» 

EE 

03 

HH 

o 

Vh 

EE 

03 

HH 

o 

EE 

• t—H 

03 

HH 

03 

03 

EE 


EE 

• rH 

03 

H-* 

o 

Vh 





M—> 

2 

© 





HH 

2 

Oh 

Spike 

o 

Vh 

Oh 

o 

03 

o 

Oh 

Oh 

cn 

Oh 

< 

nl- 

Oh 

G 

m- 

o 

Vh 

Oh 

in 

Oh 

O EE 

13 '53 

> 3-> 

•2 -5 

a b 


Oh 

O 

B 

13 


tO 

oo 

Ph 

EE 

* H 1 

03 

H —> 


'o 

© 




< 

'o 

Oh 

.d* 

'Hb 

cn 
Z 

cn 

Z 

cn 

Z 

cn 

Z 

e g 

W Oh 

03 o 

S Sh 


E3 

Z 


z 

o 

o 

Vh 

Oh 








— 













CM 





cn 


d 


in 

d 


oo 

Os 

o 


z 


CM 


O 





o 


o 


o 

o 

o 

o 

O 



t—H 


T- 1 


CM 





CM 


CM 


CM 

CM 

CM 

CM 

CM 

CM 


CM 


CM 


r- 





C- 


C- 


c- 

r- 

C- 

r- 

r- 

r- 


C- 


r- 


© 





nl- 


nl- 


nt - 

nl- 

n|- 

nl- 

m- 

n|- 


nf- 


nt - 


O 





O 


O 


© 

O 

O 

O 

© 

© 


O 


© 


CO 





CO 


o\ 


© 

© 

© 

o\ 

© 

© 


© 


Os 


o 





© 


© 


© 

© 

O 

© 

© 

© 


© 


© 


© 





©, 


©, 


©, 

©, 

©, 

©, 

©, 

©, 


©, 


o, 


Oh 





Oh 


oj 


oj 

Oh 

Oh 

Oh 

Oh 

Oh 


Oh 


Oh 


z 





z 


Z 


Z 

Z 

Z 

Z 

Z 

Z 


Z 


Z 


















O 









CM 


cn 


nl" 

in 

OO 

C- 

OO 

Os 







CD 

1 
■ 3 —> 

CD 

-G 


O 

-G 

C/3 

C/3 

• 1-H 

C/5 

G 

a 

& 

C/3 

■ 3 —> 

G 

CD 

5-1 

.CD 


73 

G 

73 

CD 

G 

o3 

B—> 

G 

O 

C/3 

W) 

o 

o 

d 

o 

cm 

O 

'-M 

G 

CD 

73 

• rH 

73 

G 

c3 

5-H 

CD 

I 

G 

G 

G 

O 

• M 

C/3 

C/3 

CD 

CD 

CD 

c3 

<D 

-G 

H 



15 Advanced In Silico Tools for Designing of Antigenic Epitope as Potential... 


341 


Human corc'flj.irvs MB 


- ScaiojptoluB bal coranavinis 512 

. Bi' cwenjfltis COPHE15JVSW2M6 

- 6sl tortn^rus IS 

- Sst c-Difflia'.'jfus 1A 
-BrtCMiifcitus HKUS 

- Rmis^cus bst cofonwRjs HXUtQ 
■ Bal tttwswuJ HKU 2 

-[.riiSuorcnsvims strain '.70112? 


■ Beluga VVnale uronawiK SVV1 


H. 


Turftei r -corcn^-ru $ 

-Thrush corona' ius HKU12-6C0 
-Wbrte-eye corsnj.irus KKU16 
=Wunra corcnff f :nis KK1J13-3514 
—Magpie-robin corwiawms KiOJlS 
Porcine townmis HKU15 
- Sparrvw cncuia^uus HKU1? 

. Corranon-moc^sn corwn^irjs HXUEt 
—KljptJieffln wraiWitftjs H Kill 9 
—VYgeoa CDron^irus HKU20 


BARS coronftius 


Sat coffliftiftis 6VJ3-3tiBGR/2D0S 
—. Eat Hp-teL3CC‘. r iDn3 , ,7MS, ( Zh,e;ar.f20l3 
— Bn tc£0(ij:ms HKU9-1 


j- Sat coronas [BtCoV/l 33 j' 2055 j 
L Bataraim’JiisHKUM 
—- 8n co(M*w$ HKUS-1 
—j. r ;ifjle East resp ratify syndrsmg tcrciainis 
— BelaeoreiUKtui ErjattusJW£/DEW 20 f 2 isolate Eri^cKisC# 2 (H 2 - 2 ifrGLR' 2 Qi: 
—- Human cc;Dna'.ifu$ HK01 


-Rat corcns.ims Parker 

- Belaccronj.-irus HKU2J Mrjn HKUZIRD5W5I 


—Rabbi: cwraarinis HKU1J 
— Equine cQrcfij.injs 
r Human torcns.irus OCiS 
Human enteric coraramjs sLfaira 44 03 


Benin* coroflftiru* 

Bwne respiratory corsnsvirus AH187 

B^ine respratay uxomryj bT.in*iUSfOH44®TDf9K 


Human c^rtn^rus 229c 


Fig. 15.1 Phylogenetic tree of 40 different strains of coronavirus using whole genome sequences 
(Alignment of genome sequences is done using ClustalW, and tree is created using NJ method from 
Unipro UGENE 1.15.1 bioinformatics toolkit) 


hydrophobic proteins. The proteins with less than 40 value of instability index are 
quite stable than those with higher values. All the proteins are having the molecular 
weight less than 110 kDa except 3 (YP_009047202.1, YP_009047203.1 and 
YP_009047204.1). This exhibits the effectiveness of lightweight proteins as targets 
as they can be easily purified because of their low molecular weights. The protein 
YP_009047204.1 is reported as a spike glycoprotein. It is acidic with prominent 
negative charge, with negative GRAVY score which suggests its hydrophilicity and 
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YPO09O4 7205.1 
YP_Q09G472QG.l 
YP 009047207.1 
YP 009047208.1 



Host e ado plasmic Host cell Host cel Host cytoplasm unrecognrzed 

reticulum membrane membrane. Host 

endoplasmic 

reticulum 

SubceUular Localization 

Fig. 15.2 Subcellular localization of seed genome proteins predicted using Virus-mPLoc 


presence on surface. However the envelope protein YP_009047209.1 and membrane 
protein YP_009047210.1 are basic and hydrophobic. 


15.5.3 Subcellular Localization Predictions 

Figure 15.2 depicts the subcellular localization of proteins of the seed genome, i.e. 
MERS-CoV. Only one protein was predicted to be localized in host cytoplasm, 
four in host membrane, two in both host cell membrane and endoplasmic reticu¬ 
lum (ER) while two in only ER, and two are left unrecognized. The known spike 
protein is predicted to be localized in host ER. From these results we decided to 
pick the proteins which are located in host membrane or were predicted to be 
localized in both host membrane and ER. The two are known envelop protein and 
membrane protein from bibliographic studies, and along with that, the known 
spike protein was also included in the filtered results. Out of the filtered proteins, 
only two (YP_009047210.1 and YP_009047208.1) contain more than two trans¬ 
membrane helices, therefore filtered out. The results of transmembrane helices 
prediction are tabulated in Table 15.6. Figure 15.3 depicts the subcellular 
localization of proteins of all the four selected genomes using Virus-mPLoc 
prediction tool. 




Table 15.6 Subcellular Localization prediction results using Virus-mPloc 
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Fig. 15.3 Subcellular localization of proteins of all four selected genomes predicted using Virus- 
mPLoc 


15.5.4 Signal Peptides 

The proteins that are predicted to possess the signal peptides by Signal-BLAST web 
server are YP_009047204.1 and YP_009047205.1. The results of Signal-BLAST 
web server are tabulated in the Table 15.7. 


15.5.5 Adhesion Probability 

This step takes into account the concept of adhesion-based virulence. Adhesions 
cause pathogen recognition and initiation of inflammatory responses by the host. 
SPAAN predicted 2 (YP_009047204.1 and YP_009047205.1) out of 11 proteins of 
MERS strain as adhesive (Table 15.8). 


15.5.6 BetaWrap 

Only one protein (YP_009047204.1) was predicted to contain BetaWrap motifs 
within it (Table 15.8). Hence, it is considered virulent and might be responsible 
for initializing the infection in the host. 
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Table 15.7 The signal peptide prediction results for proteins of MERS coronavirus strain 


S. no. 

Accession no. 

Signal blast 
(Sensitivity) 

Specificity 

Balanced 

prediction 

Cleavage site 

1 

YP_009047202.1 

No 

No 

No 

Yes 

2 

YP_009047203.1 

No 

No 

No 

Yes 

3 

YP_009047204.1 

Yes 

Yes 

Yes 

Yes 

4 

YP_009047205.1 

Yes 

Yes 

Yes 

Yes 

5 

YP_009047206.1 

No 

No 

No 

Yes 

6 

YP_009047207.1 

No 

No 

No 

Yes 

7 

YP_009047208.1 

No 

No 

No 

Yes 

8 

YP_009047209.1 

No 

No 

No 

No alignment found, 
unable to predict 

9 

YP_009047210.1 

No 

No 

No 

No alignment found, 
unable to predict 

10 

YP_009047211.1 

No 

No 

No 

Yes 

11 

YP_009047212.1 

No 

No 

No 

No alignment found, 


unable to predict 


Table 15.8 Table illustrating the prediction results made for selecting adhesion proteins using 
SPAAN, BetaWrap predictions and antigenicity predictions using Vaxijen version 2.0 


S. no 

Accession no. 

Adhesion probability 

P-value 

Vaxijen value 

TMHMM 

1 

YP_009047202.1 

0439813 

No 

04908 

14 " 

2 

YP_009047203.1 

0.442577 

No 

0.4884_J 

14 

3 

YP_009047204.1 

0.634711 

0.0046 

0.4849 

1 

4 _ 

YP_009047205.1 

0.635586 

No 

0.4226 

0 

5 

YP_009047206.1 

0.44212 

No 

0.3288 

0 

6 

YP_009047207.1 

0.269269 

No 

0.4978 

0 

7 

YP_009047208.1 

0.237608 

No 

0.3369 

3 

8 

YP_009047209.1 

0.389879 

No 

0.5119 

1 

9 

YP_009047210.1 

0.461965 

No 

0.5503 

3 

10 

YP_009047211.1 

0.690125 

No 

0.6036 

0 

11 

YP_009047212.1 

0.342692 

No 

0.6078 

0 


The transmembrane prediction results using TMHMM are also tabulated 
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15.5.7 VaxiJen 2.0 

A total of 9 out of 11 proteins of MERS strain were predicted antigenic (prediction 
values greater than 0.4). The protein with accession number YP_009047206.1 and 
YP_009047208.1 were among the filtered proteins, however, not predicted 
antigenic, therefore filtered out. As a result, only four proteins 
(YP_009047204.1, YP_009047205.1, YP_009047207.1 and YP_009047209.1) 
were kept for further analyses. 


15.5.8 AlgPred and Allertop 

None of the 11 proteins of MERS-CoV possessed any clue of allergenicity as per 
prediction results from AlgPred and Allertop tools; it means that no vigorous 
immune responses will be mounted if the epitopes from these proteins will be 
adopted as vaccine candidates. 


15.5.9 Similarity with Host Proteome 

None of the protein of MERS strain shows similarity with the proteins of host that 
demonstrates that the epitopes from these proteins can safely elicit the required 
immune response without the hazard of autoimmunity. 


15.5.10 Epitope Mapping 

In total 12 different 9-mer epitopes with potential to bind to receptors of both B-cell 
and T-cell were predicted. The list of the predicted epitopes can be found in the 
Table 15.9 and are specific for MERS-CoV strain. All these epitopes displayed no 
conservancy with proteins of other human and non-human pathogenic strains. 


15.5.11 Docking Analysis 

Docking permits to reveal the binding energy or potency of connection among 
epitopes and the receptor in appropriate orientation. The ClusPro docking server 
was used to dock the predicted 90 epitopes against HLA-A*0201. The structure of 
the receptor was available from PDB and was optimized before docking to free it 

o 

from the complexed self-peptide (4U6Y, Resolution 1.47 A, Bouvier et al. 1998). 
PEPstr (Peptide Tertiary Structure Prediction Server; Kaur et al. 2007) was used to 
derive the tertiary structure of the predicted peptides. 
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Fig. 15.4 3D structure of 
receptor site of HLA-A*0201 
visualized using Swiss PDB 
viewer 4.10. The residues 
shown in globular structure 
are known to be conserved 
and form hydrogen bonds 
with the binding peptides 



Figure 15.4 depicts the quaternary structure of the receptor HLA-A*0201 with its 
conserved active site known to form complex with the peptides (Bouvier et al. 1998). 
The binding energy results obtained after performing docking analysis are listed in 
Table 15.9. 

The 9-mer epitope VVCAITLLV at site 21 of protein YP_009047209.1 docked 
to the receptor with smallest amount of binding energy (—951.7) and 12 hydrogen 
bonds. The next epitope in the list was also from the same protein 
YP_009047209.1 at site 27, i.e. TLLVCMAFL. The predicted structure of the top 
5 potent epitopes on the basis of docking energy and the snapshots of docking results 
are displayed in Figs. 15.5, 15.6, 15.7, 15.8 and 15.9. 

The most chief restriction for developing a safe and sound vaccine against any of 
the virus is to identify the protective antigens. The present study is an effort of 
application of reverse vaccinology approach to investigate a choice of coronavirus 
proteomes to identify possible vaccine targets. This technique has demonstrated to 
be a competent way to forecast 12 different epitopes from the selected seed genome. 
These epitopes are from spike glycoprotein, NS3 protein, NS4B protein and enve¬ 
lope protein. Unfortunately none of the epitope is found conserved in other strains, 
and all are specific to MERS-CoV. The docking analysis studies revealed perfect 
binding between HLA-A*0201 receptor and epitopes. The conserved residues of the 
receptor site are also involved in H-bonding with epitope residues. Further, the 
selected antigenic epitopes must be validated using in vitro and in vivo studies to 
confirm their potential as vaccine candidates. 



350 


M. Dangi et al. 


Fig. 15.5 (a) 3D Structure of 
the 9-mer epitope starting 
from 21 (VVCAITLLV) 
position of protein 
YP_009047209.1 (b) 

Docking results of epitope 
“VVCAITLLV” with A chain 
of HLA-A*0201 using 
ClusPro. (c) The snapshot 
representing the epitope 
docked in the pocket of 
molecular surface of the 
receptor (all the structures are 
visualized using Chimera 
1 . 10 . 1 ) 





C 
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Fig. 15.6 (a) 3D Structure of 
the 9-mer epitope starting 
from 27 (TLLVCMAFL) 
position of protein 
YP_009047209.1. (b) 
Docking results of epitope 
“TLLVCMAFL” with A 
chain of HLA-A*0201 using 
ClusPro. (c) The snapshot 
representing the epitope 
docked in the pocket of 
molecular surface of the 
receptor (all the structures are 
visualized using Chimera 
1 . 10 . 1 ) 



C 
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Fig. 15.7 (a) 3D Structure of 
the 9-mer epitope starting 
from 716(GLVNSSLFV) 
position of protein 
YP_009047204.1. (b) 
Docking results of epitope 
“GLVNSSLFV” with A chain 
of HLA-A*0201 using 
ClusPro. (c) The snapshot 
representing the epitope 
docked in the pocket of 
molecular surface of the 
receptor (all the structures are 
visualized using Chimera 
1 . 10 . 1 ) 



C 
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Fig. 15.8 (a) 3D Structure of 
the 9-mer epitope starting 
from 18(YVDVGPDS V) 
position of protein 
YP_009047204.1. (b) 
Docking results of epitope 
“YVDVGPDSV” with A 
chain of HLA-A*0201 using 
ClusPro. (c) The snapshot 
representing the epitope 
docked in the pocket of 
molecular surface of the 
receptor (all the structures are 
visualized using Chimera 
1 . 10 . 1 ) 
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Fig. 15.9 (a) 3D Structure of 
the 9-mer epitope starting 
from 160(KMGRFFNHT) 
position of protein 
YP_009047204.1. (b) 
Docking results of epitope 
“KMGRFFNHT” with A 
chain of HLA-A*0201 using 
ClusPro. (c) The snapshot 
representing the epitope 
docked in the pocket of 
molecular surface of the 
receptor (all the structures are 
visualized using Chimera 
1 . 10 . 1 ) 
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